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VERSION WITH MARKINGS TO SHOW CHANGES MADE 
In the Specification: 

Paragraph beginning at line 28 of page 2 has been amended as follows: 

The present invention provides methods of modulating transcription in a plants. The 
methods comprise introducing into a plant a recombinant expression cassette comprising a promoter 
sequence operably linked to a heterologous polynucleotide sequence encoding a MYB polypeptide. 
A MYB polypeptide of the invention can be, for example, a polypeptide that is at least substantially 
identical to MYB polypeptides exemplified here (e.g., SEO ID N0S:2, 4, 6 or 8 SEC ID NOo: 2. 1, 
The polynucleotide can be, for example, SEO ID NOS:L 3. 5 or 7 SEO ID NO: L 3, 5 or 7 . 

Paragraph beginning at line 14 of page 4 has been amended as follows: 

The term "MYB polynucleotide" refers to a polynucleotides encoding a member of a 
class of transcription factors referred to here as "MYB polypeptides". MYB polypeptides are 
characterized by the presence of an amino-terminal DNA-binding domain, or DBD, consisting of 
two or three helix-tum-helix motifs of 51-52 amino acids (Rl, R2 and R3) that are highly conserved 
across phyla. MYB polypeptides may also comprise a transactivation domain. Exemplary MYB 
polypeptides are disclosed in SEO ID N0:1 SEO ID NO: 1 (GhMYB 1 GenBan k ATCC Accession 
No. L04497) and SEO ID N0:3 SEO ID NO: 3 (GhMYB 6 GenBan k ATCC Accession No. 
AF034134). Other useful sequences include sequences at GenBan k ATCC Nos. AF034130 
(GhMYB 2), AF034131 (GhMYB 3), AF034132 (GhMYB 4), and AF034133 (GhMYB 5). In 
addition, two other MYB nucleotide sequences are provided (GhMYB 7 and 8 (SEO ID NOS: 5 and 
7 SEQ ID Nog : 5 and 7) ^0ne of skill in the art will recognize that in light of the present disclosure, 
various modifications (e.g., substitutions, additions, and deletions) can be made to the MYB 
polypeptide sequences without substantially affecting their function. For example, the MYB 
polypeptides may contain functional domains from other porteins (e.g. related MYB polypeptides). 
These variations are within the scope of the term "MYB polypeptide". For example a MYB 
pQl-ypeptide-includes-the-sequences-exemplifiedJiere_as_well as polypeptides that are at least about 
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60%, usually at least about 70%, more usually at least about 80%, and often at least about 90% 
identical to the exemplified sequences. Also included are variant nucleic acid sequences that encode 
the same polypeptide as the exemplified sequences, i.e. sequences comprising degenerate sequences. 

Paragraph beginning at line 16 of page 1 1 has been amended as follows: 

Alternatively, the nucleic acids of interest can be amplified from nucleic acid samples 
using amplification techniques. For instance, polymerase chain reaction (PGR) technology can be 
used to amplify the sequences of genes directly from genomic DNA, from cDNA, from genomic 
libraries or cDNA libraries. PGR and other in vitro amplification methods may also be useftil, for 
example, to clone nucleic acid sequences that code for proteins to be expressed, to make nucleic 
acids to use as probes for detecting the presence of the desired mRNA in samples, for nucleic acid 
sequencing, or for other purposes. For a general overview of PGR, see PCR Protocols: A Guide to 
Methods and Applications, (frmis, M, Gelfand, D., Sninsky, J. and White, T., eds.), Academic Press, 
San Diego (1990). Appropriate primers and probes for identifying sequences from plant tissues are 
generated from comparisons of the sequences provided herein (e.g. SEP ID N0S:1 and 3 SEP ID 
NP: 1,SEQ IDNP:3 ). 

Paragraph beginning at line 2 of page 12 has been amended as follows: 

The isolated nucleic acid sequences prepared as described herein can be used in a 
number of techniques. For example, the isolated nucleic acids can be introduced into plants to 
enhance endogenous MYB gene expression and thereby increase expression of the genes whose 
expression is controlled by MYB polypeptides. A particularly useful gene for this purpose are the 
MYB genes shown in SEP ID NPS:1 and 3 SEP ID NP: L and 3 . 

Paragraph beginning at line 9 of page 21 has been amended as follows: 

An EcoRIIXhoI restriction fragment containing the Gil gene (Pppenheimer, D.G. et 
al, Cell 67:483-493 (1991)) was initially used to screen an unamphfied gtlO-3 dpa cotton ovule 
cDNA library (Wilkins, T.A. et al, Plant Physiol 102:679-680 (1993)). Because only one MYB 
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clone (GhMYBl) was recovered, a heterogeneous pool of homologous DNA probes spanning the 
conserved MYB DNA-binding domain (DBD) was generated by PGR for a second round of library 
screening. To amplify 159-bp 1 59 bp of the DBD, two degenerate 'universal' MYB primers, 
COT2Q GQ¥aQ ( 5'-GGNAARAGYTGYMGITTRAG-3': SEP ID N0:9 

5^ GGNi\i\R.\GYTGYMGITTR.\G 3") and C0T21 (3'-GGNCCKKCTTGTCTRTTRS-5'iSEQ 
ID NO: 10) were designed against the highly conserved stretches coding for peptides GKSCRL (SEP 
IDNP:in and PGRTDN (SEP ID NP: 12) , respectively. Using 25 |al of recombinant phage (3.4 x 
10^ pfu/|al^ fa/4) from the same unamplified library as the template, a 125 |al reaction was set-up 
containing Ix reaction buffer, 0.4 mM dNTPs, 0.5 fiM 0.5 :M of each CPT primer, and 0.04 units of 
Promega Taq DNA polvmerase/|j,l Polymoras e /:l , with the final concentration of 1 .5 mM MgCh 
provided by the phage storage buffer. PGR was performed in a thermal cycler (Ericomp) as follows: 
30 sec at 94^C-^4EG (1 cycle); 2 min at 92!C-^3BG, 2 min at 48°C-48EG. 2 min at 72!C-^^3BG (25 
cycles); and 10 min at 72°C 72EC (1 cycle). The 159-bp amplicon was cloned into the pT7Blue TA- 
cloning vector and transformed into E, coli NovaBlue competent cells (Novagen). Nucleotide 
sequencing (Sanger, F. et al, Proc Natl Acad Sci USA 74:5463-5467 (1977)) of 24 independent 
transformants identified a total of six different MYB DBDs. Equal amounts of the 159-bp amplicon 
from each of the six PGR clones, released by £co/?/ digestion, were combined in a heterogeneous 
pool of DBD sequences for use as a homologous hybridization probe. The pooled DNA probes were 
radiolabeled with [^^P1-dATP ^] dATP by random-primer labeling (Feinberg, A.P. et al, Anal 
Biochem 132:6-13 (1983)). Two sets of plaque Hfts containing 3-4 x 10^ recombinant phage from an 
amplified gtlO-3 dp a 3 dpa cotton ovule gtlO cDNA library (Wilkins, T. A. et al, Plant Physiol 
102:679-680 (1993)) were prepared using Hybond-N nylon membranes (Amersham). Both sets of 
plaque lifts were hybridized overnight at 42^C 4 2EC in 50% formamide buffer according to the 
membrane manufacturer's instructions. Lifts hybridized to the Arabidopsis AtMYBG/1 probe were 
washed in 2x SSC, 0. 1% SDS at 60°C 60EC for 30 m.in (moderate stringency), whereas lifts 
hybridized to the heterogeneous pool of cotton DBDs were washed in 0.2x SSC, 0.1% SDS at 60''C 
60BG (high stringency). DNA prepared from 15 purified recombinant-phage plaques (Sambrook, J. 
et aL, Molecular cloning: a laboratory manual Cold Spring Harbor Laboratory Press Cold Spring 
Harbor NY 2nd Edn. (1989)) were Ecd91 digested and electrophoresed in low melting point agarose. 
They were subcloned into pUCl 18 using the excised gel slices directly in the Ugation reaction 
J^\Mi\;]^BWecMiq^ — 
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nucleotide sequence (Sanger, F. et al, Proc Natl Acad Sci USA 74:5463-5467 (1977)) of both 
strands was determined for each of the 15 GhMYB isolates (designated A through 0), using 
Sequenase 2.0 (US Biochemical) or the Klenow fragment of DNA Polymerase I (GibcoBRL) in 
overlapping nested deletions of single-stranded DNA as templates (Dale, R. et al, Methods Enzymol 
155:205-231 (1987)). To comply with standard nomenclature for R2R3-MYB genes, the six unique 
cotton MYB clones A, D, G, J.NaadO were renamed and numbered sequentially as GhMYBl 
through GhMYB6, respectively. 

Paragraph beginning at line 28 of page 22 has been amended as follows: 

Genomic DNA was extracted from young expanding leaves of G. hirsutum L. as 
described previously (Wilkins, T.A. et al. Theor Appl Genet 89:514-524 (1994)). • CsCI-purified 
DNA (20 fxel (20 :e) was completely digested with EcoRI, Hindlll or Hindll, fractionated in 1% 
agarose gels at 1.8 V/cm for 12-15 h, and transferred to Zetabind nylon membrane (Cmo, Meriden, 
CT) by capillary blotting in lOx SSC transfer buffer. Afterwards, the DNA was affixed to the 
membrane matrix by UV cross-linking. Membrane hybridization was performed at 65 °C 6SEC in 
5x SSPE, 7% SDS, and 250 ^ig/ml 250 :g/ml of sheared, denatured sahnon sperm DNA. Gene- 
specific hybridization probes were generated by PGR, using oligonucleotide primer-pairs designed 
against unique nucleotide sequences of the six cotton MYB cDNAs (GhMYBI-6). The primers 
(Operon Technologies), with respective sizes of amplified fragments were: COT 105 ( 5'- 
AAGCAGAGGAATTGATCCAC-3': SEP ID NO: 13) (5- AAGCAGAGGi\ATTGA TCCAC 3-) 
X COT106 r3'-CTGGGAACCTAAGTATCCCA-5': SEP ID N0:14V ^ 
CTGGGi\i\CCTi\i\GTATCCCA 5") , 538 bp; CPT107 (5'-CCTCGGAACAAATTGTGCC-3': 
SEP ID NP:15) (5- CCTCGGAACA.\i\TTGTGCC 3-) x COT108 Q^z 
GCCTTCCAACGAAACCAAACC-5': SEO ID NO: 16) (3- GCCTTCCi\ACGi\i\ACCAi\ACC 
5=), 153 bp; COT109 f5'-CAGAAGGAGAAACACAGAGG-3': SEO ID N0:17V ^ 
rAr,A AfiGAfiA AArACAGAGG 3-1 x COTl 10 r3'-GGCTGTATCACTTGACATCG-5': SEO ID 
NO: 18) f3- GGCTGTATCACTTGACA TCG 5-) , 412 bp; COTl 1 1 £5!; 

CC ATTAACTCAAAGCATGCC-3': SEO ID NO: 19) (5- CCATT.\ACTCi\Ai\GCATGCC 3-) x 
COTl 12 (3'-CGAGGAGGAACAAGGAGGAC-5': SEO ID NO:20)- (^^e^ 
_CGAGGAGGi\i\C\i\G GAGGAC 5-) . 861 bp; COTl 13 f5'-AGTCCAGAAGCAGGCCAAGC-3^ 
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SEP ID N0:2n (5- AGTCCAGAAGCAGGCCi\i\GC 3-) x COTl 14 O!; 
GGTGTACTTAAGCATTAGCA-5': SEP ID NO:22) (3- GGTGTACTTi\i\GCATTAGCA 5-) . 
545 bp; and CPT 115 (5'-CACCGCCCACTGGTATATCC-3': SEP ID NG:23H #=- 
PArrr . rrr ArTGGTATATrC in x CPT 1 16 r3'-CCGTTGTACGTGCGGTAACA-5': SEP ID 
NPi24 }(3' CCGTTGTACGTGCGGTAi\CA 5-) . 243 bp. The concentrations of the PGR 
components in a 25 \i\ 25 :1 reaction were Ix synthesis buffer, 1 .5 mM MgCb, Q,2jxM 0.2 :M of 
each primer, 0.2 mM dNTPs, and 0.04 units of Tag polvmerase/^l polvmoraoo/:! . The temperature 
cycling conditions included the same denaturing and extension steps as stated previously, but 30 
cycles of 1 min- 4-mffl at 94°C- 9^G. 1 min- 4-aHa at 48°C- 48E€. and 1 min at 74°C^ ?4BG were used 
instead. The resulting PGR products of the expected molecular weight were purified using 
Promega's Magic® PGR columns, cloned into the pCR^" II T- vector (Invitrogen) and transformed 
into E. coli E. co/i MVl 190. These PGR fragments were radiolabeled and used as gene-specific 
hybridization probes. Blots were washed under high stringency conditions in O.lx SSC, 0.1% SDS at 
65°C 6SEC for 30rnin-30^»H» prior to autoradiography. 

Paragraph beginning at line 26 of page 23 has been amended as follows: 

Semi-quantitative RT-PCR was performed with minor modifications (An, Y-Q. et al. 
Plant Cell 8:5-30 (1996)) using total RNA isolated from various tissues and developing ovules by 
the procedure of Wan, C-Y et al, AnalBiochem 223:7-12 (1994). First-strand cDNA synthesis was 
performed using 1.5 n g 1.5 :g of total RNA as the template according to recommendations in the 3' 
RACE kit (GibcoBRL). The amount of cDNA synthesized, as determined by spectrophotometer and 
fluorometer measurements, indicated cDNA yields ranging from 3 to 6 x 10^ ng per RT reaction. 
Semi-quantitative PGR was performed in two sequential amplification steps using ten-fold serial 
dilutions (lO'V.lO'^ and 10'^) from the same amount of first-strand cDNA (500 pg). Attempts to 
include more than one primer-pair in a given reaction produced inconsistent results (data not shown), 
therefore expression analysis was performed independently for each GhMYB gene. PGR products 
could not be detected in dilutions greater than 10"^ by agarose gel electrophoresis. In the first round 
of amplification, PGR was performed using the universal MYB primer GGT2P and the universal AP 
primer provided with the 3' RAGE kit. The volume of each 10-fold RT-dilution step used as the 
4ejriplatR^was-R-%-ofJheJinal PCRjmlmne._In.the amplification, gene-speci fic primer- pairs 
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(see above and Fig. lb) were employed, using 8% of the previous PGR reaction as the template. The 
two sequential rounds of PGR amplification included Ix reaction buffer, 0.2 mM dNTPs, 0.2 [xM- ^ 
tM for each primer, and 0.05 units of Tag DNA polvmerase/|al polvmoras e /:! of the reaction. 
Temperature cycling conditions were the same as described above for gene-specific amplifications. 
PGR products fi-om the second amplification were resolved in 1% agarose gels stained with ethidium 
bromide and visualized using a IS 1000 still-video imaging system (Alpha Innotech). Recorded 
images were stored as TIF files. At least three replicated experiments were performed for each 
gene-specific primer-pair. As a reference, a 300-bp portion of the vacuolar H"^-ATPase catalytic 
subunit was amplified by the primer-pair [G0T8 X G0T9] fWilkins et al. 1994 ) fWilkina ct a!.. 
199 4 ) under the same conditions to verify that the tissue-specific distribution and developmental 
profile produced by semi-quantitative RT-PGR was consistent with the expression pattern produced 
by northern blot analysis or ribonuclease protection assays (Smart, L.B. et al. Plant Physiol 
116:1539-1549(1998)). 

Paragraph beginning at line 26 of page 24 has been amended as follows: 



from ovules at -3 dpa, the stage at which trichome primordia are fully potentiated to develop 
(Wilkins, T.A. et al. In Basra AS (ed) Cotton Fibers. Food Products Press New York (1999)). 
Gomplete sequence analysis of the 15 ovule cDNA clones revealed the presence of six distinct MYB- 
domain genes, designated as GhMYBl through GhMYB 6 (Fig. 1) , based on both the nucleotide 
variation within the DBD and the presence of imique G-terminal domains. Among the clones 
characterized, three appeared to be closely related members oi GhMYBl, -3 and -6, while the 
remainder of the 15 clones represented additional full-length or truncated versions of the six 
GhMYBs. Not surprisingly, the region spanning the amino-terminal DBD was very highly 
conserved among all six GhMYBs, with amino acid identities/similarities ranging firom a low of 
54.8/16.4% (GhMYB5 vs. GhMYB6) to a high of 84.6/1 1.5% (GhMYBl vs. GhMYB6). Structural 
similarities among the cotton and other plant MYBs includes a typical R2/R3 repeat, the tryptophan 
hydrophobic core and conserved DNA base-contacting residues that function in recognition 
specificity (Martin, G. et al, Trends in Genet 13:67-73 (1997); Ogata, K. et al, Cell 79:639-648 
(1994)). For descriptive purposes of this work, the entire G-terminal region downstream of the DBD 



Fifteen cDNA clones encoding cotton MTB-domain (GhMYB) genes were isolated 
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was designated as the transcriptional (trans-) regulatory region, or TRR. We found this designation 
appropriate, considering that (i) both the TAD and NRD are located in this region in animals, (ii) the 
relative position of the TAD within the MYB C-terminal region may vary, and (iii) the TRR region 
varies considerably in both the number and composition of amino acids in a MYB-specific, even 
within a given species (Avila, J. et al, Plant J 3:553-562 (1993); Jackson, D. et al., Plant Cell 
3:115-125 (1991); Li, S.F. etal. Plant J S:963-97 2 (1995)); and (iv} the number and type f 
conserved motifs varies from MYB-to-MYB. Apart from a few conserved motifs, each GhMYB 
TRR is unique, ranging in size and amino acid composition, and in the location of putative leucine- 
zipper structures. 

Paragraph beginning at line 18 of page 25 has been amended as follows: 



regulatory aspects of MYBs. First of all with the noted exception of GhMYB2, the remaining 5 
GhMYBs contain a conserved stretch of 40 amino acids with a positive net charge (basic pi) in the 
5'-portion of the TRR proximal to the DBD. To our knowledge, this is the first clear description of 
such a basic domain outside of the DBD, and consequently led to the subdivision of the TRR into a 
basic 'transregulatory region 1' (TRRl), and an acidic 'transregulatory region 2' (TRR2) in these 
MYBs. Secondly, the presence of a conserved GIDPxxH (SEP ID NO:25') motif was noted within 
the TRRl of GhMYB 1 and GhMYB6, and is located precisely 12 amino acids downstream of the 
last tryptophan of the R3 repeat. When present, the GIDPxxH (SEP ID NO:25) motif is found in 
exactly the same position of other plant MYBs, irrespective of whether there is a TRRl domain or 
not (Jackson, D. et al., Plant Cell 3:1 15-125 (1991); Li, S.F. et al., Plant J S:963-972 (1995); Lin, Q. 
et al., Plant Mol Biol 30:1009-1020 (1996); Marocco, A. et al, Mol Gen Genet 216:183-187 (1989); 
Wissenbach, M. et al, Plant J 4:4\ 1-422 (1993)). Analysis of the six GhMYBs and 24 additional 
plant MYB sequences therefore established that there was no apparent association between the 
presence, or absence, of this motif and TRRl. Thus, plant MYBs having a TRRl may (GhMYBl 
and -6) or may not (GhMYB3, -4 and -5) necessarily contain the GIDPxxH rSEPIDNP:25) motif 
Third GhMYBl possesses a cysteine-rich zinc-finger motif rCXiCX.nCX^C (SEP ID NG:26') . where 
X = any amino acid; Chopra, S. et al. Plant Cell 8:149-1 158 (1996)) near the carboxyl-terminus of 
the TRR, which confers a unique protein structure to this GhMYB by the presence of two potentially 



Interesting structural features were identified that may have implications on 
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functional DNA-binding domains within a single polypeptide. Finally, the presence of small 5'- 
upstream open reading frames (5'-uORFs) located within the 5 '-untranslated region of GhMYB4 and 
-5 are worth noting, since such uORFs have been shown to drastically interfere with the level of 
translation of the correct ORF of other transcription factors (Damiani, R.D. et al, Proc Natl Acad 
Sci USA 90:8244-8248 (1993); Lohmer, S. et a/., Plant Cell 5:65-73 (1993)). 

Paragraph beginning at line 28 of page 27 has been amended as follows: 

A semi-quantitative reverse transcription-polymerase chain reaction (RT-PCR) 
method (An, Y-Q. et al, Plant Cell 8:5-30 (1996)) was employed using TRR gene-specific primer- 
pairs to determine the expression pattern and relative abundance of individual GhMYB transcripts. 
In control experiments, oligonucleotide primers did not amplify non-specific PGR products. As a 
point of reference, the transcript profile of the vacuolar H^-ATPas c H''' ATPoo e catalytic subunit 
produced by RT-PCR from the RNA source used to amplify GhMYB mRNAs was identical to 
published expression patterns detailing the spatial and temporal regulation of this gene (Hasenfratz, 
M-P etal. Plant Physiol 108:1395-1404 (1995); Smart, L.B. et al, Plant Physiol 116:1539-1549 
(1998)). A serial dilution of each RT reaction (10^ 10"\ 10*^ and 10"^) was performed with a fixed 
amount of first-strand cDNA to restrict the availability of template during PGR ampUfication. By 
doing so, only the more abundant messages can be detected at higher dilutions. 

Paragraph beginning at line 6 of page 28 has been amended as follows: 

To evaluate the spatial pattern oi GhMYB gene expression, semi-quantitative RT-PGR 
analysis was performed using total RNA from several organs and tissues. Expression of GhJMYl, -2 
and -5 was detected at the 10"^ dilution in the all tissues and organs tested, including developing 
cotton fibers, although the relative abundance for these GhMYB transcripts was considerably lower 
in pollen and stigmas. In contrast, expression of GhMYB4, -5 and -6 varied considerably in relative 
abundance and was spatially regulated in a tissue-specific manner. To distinguish between the two 
expression patterns, the broad distribution of GhMYB 1, -2 and -3 transcripts was termed as type I, 
whereas type II referred to the tissue-specific pattern of expression exhibited by GhMYB4, -5, and -(5, 
which includes the absence of transcripts in stigmatic tissue. GhMYB4 is preferentially expressed in 
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ovules since mRNA was strongly detected at the 10""^ dilution in ovules + fibers, but only at < 10"^ in 
isolated fibers, indicating at least a 10- to 100-fold difference in transcript abundance. Similarly. 
GhMYB4 transcripts were present in roots, leaves, and petals, but in lower abundance (<10'^) than 
observed in ovules. GhMYB4 expression was not detected whatsoever in bracts, pollen, anthers or 
embryos (10^ dilution). GhMYBS mRNA, on the other hand, was clearly detected in bracts, and to a 
lesser extent in petals and anthers, but was in low abundance in roots, leaves, ovules, + fibers (10" 
dilution), and was barely detectable in pollen (10*^ dilution). GhMYBS transcripts in petals were of 
slightly lower molecular weight than expected in all experiments. Since this phenomenon has been 
observed to selectively occur in floral tissues using unrelated primers (Hasenfi-atz, M-P et al, Plant 
Physiol 108: 1395-1404 (1995)), one possible explanation for the difference in transcript size is the 
tissue-specific use of alternative poly(A) signals or alternative splicing, although amplification of a 
PGR artifact cannot be totally excluded. GhMYBS transcripts were strongly detected in roots, bracts, 
petals, anthers, ovules + fibers, and fibers, and to a lesser degree in leaves and embryos at the 10" 
dilution. GhMYB6 mRNA in pollen was detected at dilutions <10"^ (Fig. la) . Although the spatial 
distribution and transcript abundance of GhMYB6 shared characteristics of both type I and type 11 
expression patterns, and was therefore somewhat intermediate between the two types, GhMYB6 was 
classified as type II for the time being. 



Paragraph beginning at line 24 of page 30 has been amended as follows: 



As expected, the R2R3 structure of the DNA-binding domain of the six newly 
identified cotton MYB genes (GhMYB) is highly conserved, whereas the amino acid sequence of the 
C-terminal domain, termed the transcriptional regulatory region (TRR), is highly variable. The TRR 
of GhMYB4 is considerably longer than the average plant MYB resulting in a R2R3-Myb factor of 
unusual molecular weight (50.8 kD). Beyond the differences in length and composition of the TRR, 
several other interesting structural features that may influence the target gene specificity of GhMYBs 
warrant ftirther discussion. GhMYB 1 in particular, is one distinct example in that it contains a 
cysteine-rich domain similar to a zinc-finger motif, CX1CX10CX2C (SEP ID NO:26) (Chopra, S. et 
aL, Plant Cell 8:149-1 158 (1996)). This MYB DBD/zinc-finger combination was first reported in a 
P allele from maize and one which exhibits a distinct tissue-specific pattern of expression (Chopra, 
S. et al, Plant Cell 8:149-1 158 (1996)). A suggested in the maize study, the presence of two 
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potentially functional DNA-binding domains within a single polypeptide may confer a unique means 
for modulating gene expression, although this supposition has yet to be tested in either species. 

Paragraph beginning at line 5 of page 31 has been amended as follows: 

A second noteworthy feature of R2R3-MYBs identified in this study is a basic 40- 
amino acid region of the TRR, designated as TRRl to distinguish this subdomain from the acidic 
portion (TRR2) of the C-terminus (Fig. lb. Table 1 . The TRRl which is located immediately 
downstream of the DBD, is found in a subset of plant MYBs or about 50% of the R2R3-MYBs 
examined so far (Avila, J. et al, Plant J 3:553-562 (1993); Cone, K.C. et al. Plant Cell 5:1795-1805 
(1993); Grotewold, E. etal. Proc Natl Acad Sci USA 88:4587-4591 (1991): Jackson, D. et al. Plant 
Ce// 3:115-125 (1991); Li, S.F. etal. Plant J S:963-972 (1995); Lin, Q. etal. Plant Mol Biol 
30:1009-1020 (1996); Noda, K-I et al. Nature 369:661-664 (1994); Urao, T. et al. Plant J 10:1145- 
1 148 (1996); Wissenbach, M. et al. Plant J 4:411-422 (1993)). Although the functional 
significance of TRRl is not known, one might reasonably infer from the basic nature of this region 
that the TRRl subdomain plays a role in modulating the interaction with DNA molecules (Mitchell, 
P.J. et al. Science 245:371-378 (1989)). The fact that GhMYB2 and many other plant MYBs, lack a 
TRRl subdomain would suggest a degree of specialization for these particular MYB proteins in 
terms of DNA-binding affinity and/or sequence recognition. It was also noted that the conserved 
motif, GTDPxx H TSEO ID NQ:25'> . is present within the TRRl domain of GhMYBl and GhMYB6 
and is located precisely 12 amino acids away from the last tryptophan of the R3 repeat in both 
proteins-^4gr4b). Interestingly, while the TRRl-GIDPxxH (SEP ID NQ:25) combination identified 
in GhMYBl and GhMYB6 is conserved in MIXTA (AmMYBMx). Glabrousl (AtMYBGIl) does 
not contain either a TRRl or GIDPxxH (SEP ID NO:25) motif and is therefore structurally 
analogous to GhMYB2. Taken together, the combined diversity of DBD, TRRl and TRR2 domains, 
plus the presence/absence of particular motifs, may endow GhMYBs with greater flexibility in the 
formation of functional transcription complexes (Ess, K.C. et al, Mol Cell Biol 15:5707-5715 
(1995); Kanei-Ishii, C. et al. J Biol Chem 269:15768-15775 (1994); Pelgeschlager, M. et al. EMBO 
715:2771-2780(1996)). 



Parag raph beginning at Une 31 of page 31 has been amended as follows: 
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A third interesting structxiral property that Ukely modulates expression of GhMYB4 
and GhMYBS at the translational level is the presence of 5'-uORFs (Fig. lo) . The effect of these 
small upstream open reading frames on translation is well known - resulting in decreased synthesis 
of the major polypeptide by interfering with re-initiation of translation at downstream start condons 
(Damiani, R.D. etaL Proc Natl Acad Sci USA 90:8244-8248 (1993); Lohmer, S. et al. Plant Cell 
5:65-73 (1993)). For GhMYBS in particular, the AUG initiation context (Dasso, M.C. et al, Eur J 
Biochem 187:361-371 (1990); Gallic, D.R. Annu Rev Plant Physiol Plant Mol Biol 44:77-105 
(1993)) for a 5'-uORF is stronger relative to what is presumably the main ORFs (data not ohown) , 
suggesting that the rate of translation may be very low. Moreover, the initiation codon of GhMYBS' s 
5'uORF is separated from the main start codon by only a single nucleotide, which generates two 
overlapping ORFs (Fig. Ic) . The physical proximity of AUGs may conceivably generate 
competition between the two ORFs for ribosomal machinery, thus affecting the rate of GhMYB5 
synthesis. The physical proximity of AUGs may conceivably generate competition between the two 
ORFs for ribosomal machinery, thus affecting the rate of GhMYB5 synthesis. 

Paragraph beginning at line 13 of page 32 has been amended as follows: 

Based on multiple sequence alignment algorithms, the DBD amino acid sequences of 
cotton MYB proteins show structural similarity to plant R2R3-MYB factors implicated in 
phenylpropanoid biosynthesis-(FigT-3). Based on the extensive phylogenetic analysis performed by 
Romero, L. et al, Plant J 14:273-284 (1998), and which also includes GhMYBl (formerly 
GhMYBA), GhMYB proteins invariably belong to group C and therefore likely exhibit a preference 
for type IIG DNA-binding sites. GhMYB5 is the most distantly related cotton R2R3-MYB and is 
found in an isolated cluster that includes the drought-inducible AtMYB2 (Urao et al, 1996). Amino 
acid comparisons of DBD and TRR domains from GhMYBs MD(TA (AmMYBMx) and Gil 
(AtMYBGIl) did not reveal any striking similarity beyond conserved motifs (Tabl e 2) . However, 
based on established DBD structural similarities to other R2R3-MYB factors (Fig. 2) , GhMYB2, 
GhMYB3, and GHMYB4 are members of a phylogenetic group that contains Glabrous 1, while 
GhMYBl and GhMYB6 belong to a closely related cluster (Romero, L. et al. Plant J 14:213-2S4 
(1998)). Considering that both Arabidopsis leaf and cotton seed trichomes are single cells, the 
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phylogenetic relationship between Glabrous 1 and GhMYBs is especially intriguing from a 
functional standpoint. 



Paragraph beginning at line 29 of page 32 has been amended as follows: 

The general trend found among plant species is for a large number oiMYB genes 
(Romero, L. et aL P/an/ 7 14:273-284 (1998); Lin, Q. etaL, Plant Mol Biol 30:1009 A020 (1996); 
Solano, R. et al, Plant J S:673-6S2 (1995b)). Genomic Southem blots hybridized with GhMYB 
DBDs (data not shown) established that this is also the case in cotton. Yet, DNA blots hybridized 
with gene-specific probes (Fig. 2) indicate each GhMYB class is encoded by a small gene family 
consisting of only a relatively few number of genes. The DNA blots probably represent the 
minimum number of genes belonging to each family, since it is a strong possibility that the 
hybridization pattem does not distinguish alloalleles derived from of the AA and DD genomes of G. 
hirsutum (Wilkins, T.A. et al, TheorAppl Genet 89:514-524 (1994)). However, the increase in 
ploidy level likely results in a simple amplification of the number of genes in each family. Thus, the 
six novel GhMYB genes identified in this study represent only a small subset of the MYB genes 
encoded by cotton genome. 



Paragraph beginning at hne 8 of page 33 has been amended as follows: 

A semi-quantitative RT-PCR approach (An, Y-Q. et aL, Plant Cell 8:5-30 (1996)) 
proved to be a key means to characterizing the differential expression pattems for each of the six 
GhMYB genes since transcript abundance was too low to be detected by conventional RNA blot 
analysis. RT-PCR experiments (Fig. 3) revealed that the spatial and temporal regulation of all six 
GhMYBs form two distinct pattems of gene expression. Type I GhMYB {GhMYB 1, -2 and -5) 
transcripts were more abundant than type II genes and were found in all tissue-types examined, 
suggesting that type I cotton MYBs regulate cellular fimctions common to all these tissues. In 
contrast, type II cotton GhMYBs, {GhMYB4 -5, and -6) are not only spatially and temporally 
modulated to a greater degree than type I GhMYBs-^Pi^r4), these mRNAs are much less abundant 
than type I messages since type II transcripts are detected only at the lower RT dilutions for the most 
part. As suggested previously (Jackson, D. et al. Plant Cell 3:11 5-125 (1991); Larkin, J.C. et al, 
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Plant Cell 5:1739-1748 (1993); Solano, R, et al„ EMBO J 14:m3-m4 (1995a)) for greater other 
plant MYB genes, the spatial and temporal modulation of type II MTB^ suggests that these MYBs 
may be involved in modulating the fine-tuned control of specific cellular functions. Interestingly, 
the distinct pattern of transcript accumulation observed for GhMYB4 and GhMYBS and the presence 
of putative 5'-uORFs in the 5 '-untranslated region suggests that the spatial and temporal expression 
of these type II cotton MYBs are subject to complex regulation at both the transcriptional and post- 
transcriptional levels. 
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